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© Cysteine added variants ("CAVs") of EPO are provided having one or more cysteine residues substituted for 
selected naturally occurring amino acid residues, or inserted into the polypeptide sequence, and preferably 
being further modified by deletion of certain N-terminal amino acids. Such CAVs may be additionally modified 
by the coupling of sulfhydryl reactive compounds to the introduced cysteine residue(s) without loss of bioactivity 
to produce selected homogeneously modified EPO and improved pharmaceutical compositions containing the 
same. 
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Hot Ala Pro Pro Arg Leu He Cya Asp Ser Arg Val 
5 ' ATATG CCA CCA CCA AGA TTA ATT TGT CAT TCT AGA CTA 
TAC OCT GOT GCT TCT AAT TAA ACA CTA AGA TCT CAT 



Leu clu An? Tyr Leu Lea Gin Ala lye Clu Ala Gin Aan 
TTA GAA CGG TAC CTC TTG GAG GCC AAC GAG GCC CAC AAT 
AAT CTT CCC ATG GAG AAC CTC CGG TTC CTC CGG CTC TTA 



23 



He Thr Tor Cly Cys Ala Glu Hia Cyi 8»r Uu X» Clo 
ATC ACG ACC GCC TGT OCT GAA CAC TGC AGC TTG AAT GAG 
TAG TGC TGC CCC ACA CGA CTT CTC ACG TOG AAC TTA CTC 



Aan Ha Thr Val Pro Aap Thr Ly» Val Am Fh* Tyr Ala 
AAT ATC ACT GTC CCA GAC ACC AAA CTT AAC TTT TAC GOG 
TTA TAG TGA CAG GOT CTC TGG TTT CAA TTG AAA ATG CSC 



Trp Lye Arg Mat Clu Val Gly Gin Gin Ala Val Clu Val 
TGG AAA AGA ATG GAA CTT CGC CAG CAG OCT OTA GAA CTA 
ACC TTT TCT TAC CTT CAA CCC GTC CTC CGA CAT CTT CAT 



Trp Gin Gly Leu Ala Leu tan Sar Gin Ala Val Leu Arg 

TGG CAC GGA TTA CCG CTA TTA ACT GAA GCT CTT CTC CGC 
ACC GTC CCT AAT CGC GAT AAT TCA CTT CGA CAA GAG OCO 



Gly Gin Ala Leu Leu Val Aan Sar Bar Gin Pro Trp Gin 
GCT CAG GCT TTA TTA GTC AAC TCT TCC CAG CCG TGG CAG 
CCA CTC CCA AAT AAT CAG TTG AGA ACC GTC GCC ACC CTC 



Pro Leu Gin Leu Hia Val Aap Lya Ala Val Ser Cly Leo 
CCC CTC CAG CTC CAT GTC GAT AAA GCC GTC AGT GCC CTT 
CGG GAT CTC CAC CTA CAC CTA TTT CGG CAG TCA CCC CAA 



Arg Sar Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Cln 
CCC ACC CTC ACC ACT CTC CTT CCG CCT CTC GGA CCC CAC 
CCG TCG GAC TCC TGA CAC GAA GCC CCA GAC CCT CCC CTC 



Lya Glo Ala Ha Sar Pro Pro Aap Ala Ala Ser Ala Ala 
AAC GAA GCC ATC TCC CCT CCA GAT GCC CCC TCA GCT GCT 
TTC CTT CGG TAG ACG GGA GCT CTA CCC CCG AGT OCA CGA 



Pro Lam Arg Thr Xla Thr Ala Aap Thr Fhe Arg Lya Leu 
CCA CTC CGA ACA ATC ACT CCT GAC ACT TTC CGA AAA CTC 
GOT GAG GCT TCT TAG TCA CGA CTC TCA AAG GCT TTT GAG 



Fhe Arg val Tyr Sar Aan Poa Leu Arg Gly Lya Lea Lya 

TTC CGA CTC TAC TCC AAT TTC CTC CGG GGA AAG CTC AAG 
AAC GCT CAG ATC AGO TTA AAG GAG GCC CCT TTC CAC TTC 



Leu Tyr Thr Cly Cln Ala Cya Arg Thr Gly Aap Arg 
CTC TAC ACA GGG GAG GCC TGC AGC ACA CGG CAC AGA 
GAC ATC TGT CCC CTC CGG ACG TCC TGT CCC CTO TCT 



STOP 

TAA TAATGATACCATCCT 

ATT ATTACTATCCTAGGACATC < 
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TECHNICAL FIELD 

This invention relates generally to polypeptides modified by the attachment of compounds having 
suHhydryl reactive groups, improved methods for producing such modified polypeptides and improved 
5 compositions containing them. The invention relates particularly to three modified polypeptides (IL-3, G-CSF 
and EPO), to which suKhydryl reactive compounds, including polymers, may be attached at selected 
positions in the polypeptide that have been modified by the insertion of cysteine residues or the substitution 
of cysteine residues for other residues. 

10 BACKGROUND 

The desirability of modifying biologically active and therapeutically useful polypeptides with a variety of 
compounds, such as the hydrophilic polymer polyethylene glycol (PEG), to enhance their pharmacokinetic 
properties has been noted. See, e.g., the discussion of the art in this area of polypeptide modification in 
published PCT patent application WO87/00056, in U.S. Pat. No. 4,179,337, which discloses conjugating 
water soluble polypeptides such as enzymes and insulin to PEG or PPG, and in U.S. Pat. No. 4,766,106, 
which discloses conjugating ordinarily water insoluble beta-interferon, interieukin-2, or immunotoxins to PEG 
homopolymers or polyoxyethylated glycerol. Such modification can reduce adverse immune response to 
the polypeptide, increase the solubility for use in pharmaceutical preparations and maintain a desirable 
circulatory level of such polypeptide for therapeutic efficacy. 

One problem not addressed by the art in this area involves the extent to which a polypeptide can be 
modified by attachment of compounds having reactive groups that will covalently bond to certain amino 
acid residues of the polypeptide. For example, modification of a polypeptide with PEG or similar polymers, 
can result in random attachment of the polymer at the amino terminus of the polypeptide and/or at one or 
more lysine residues in the amino acid sequence of the protein. Because more than one PEG group can 
attach to the polypeptide, the resultant composition may contain a heterogeneous mixture of "PEGylated" 
polypeptide; some polypeptides having only one PEGylated site, others having more than one PEGylated 
site. Such heterogeneity in composition is undesirable for pharmaceutical use. Furthermore, the non- 
specificity with regard to the site(s) of attachment of compounds such as PEG to the polypeptide can result 
in loss of biological efficacy of the polypeptide stemming from undesirable attachment to a polypeptide site 
required for biological activity. United States Patent 4,904,584 addresses the foregoing by providing 
materials and methods for site specific covalent modification of polypeptides by lysine insertion, removal, 
and/or replacement. However, we have determined that the use of lysine as the attachment site for 
modification, for example, by PEGylation, may be disadvantageous because not all modifications may result 
in biologically active compounds and because steps must be taken to prevent PEGylation at N-termini in 
cases where N-terminal PEGylation is not desired. 

SUMMARY OF THE INVENTION 

40 This invention provides materials and methods for site specific covalent modification of polypeptides, 
particularly and preferably human IL-3, granulocyte colony stimulating factor (G-CSF) and erythropoietin 
(EPO) polypeptides, permitting the production of compositions comprising homogeneously cys modified IL- 
3s, G-CSFs and EPOs and pharmaceutical compositions containing the same. "Homogeneously cys 
modified" as the term is used herein means substantially consistently modified only at specific, inserted or 

45 substituted cysteine residues. A homogeneously modified IL-3 for example, includes an IL-3 composition 
which is substantially consistently modified at position 6 (using the convention of counting, from the N- 
terminus of the mature protein) by the insertion of cysteine in place of the threonine of natural IL-3, but not 
at other positions. 

Thus, this invention first provides cysteine added variants ("CAVs") of IL-3, G-CSF and EPO. CAVs of 
so this invention encompass IL-3, G-CSF and EPO muteins that contain at least one additional cysteine residue 
compared to the corresponding naturally occurring or previously known IL-3, G-CSF and EPO. The cysteine 
residue(s) are introduced into the peptide structure of the CAVs at one or more amino acid positions in the 
natural or previously known counterpart 

In the case of human IL-3, we have determined that the naturally occurring cysteine residues at 
55 positions 16 and 84 form a disulfide bridge, essential to preserving the desired biological activity of the 
polypeptide. For the addition of novel cysteines, some positions within the polypeptide, such as position 15 
and 51 are unsuitable; cysteines introduced at these positions give rise to human IL-3 polypeptides with 
substantially reduced biological activity. However, certain substitutions or deletions of residues 1-14 do not 
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significantly diminish the desired biological activity of IL-3. Therefore, a preferred region of novel cysteine 
introduction into the polypeptide is within positions 1-14 inclusive. Currently , positions 6-12 inclusive are 
especially preferred sites for cysteine introduction. The subsequent attachment of sulfhydryl reactive 
compounds, including polymers, as discussed below, to the novel cysteines added at selected positions 

5 within this region will not result in any significant loss of biological activity. 

By "cysteine added variant" as the term is used herein, we mean variants of IL-3, G-CSF and EPO that 
are modified in amino acid structure relative to naturally occurring or previously known counterparts such 
that at least one cysteine residue is inserted into the natural or previously known sequence and/or is used 
to replace a different amino acid within that sequence. 

10 Additionally, with respect to IL-3, the native or "natural" IL-3 sequence, with an added initiator 
methionine for bacterial expression, may be further modified such that the first alanine is deleted at the N- 
terminus of the mature polypeptide, altering the amino terminal sequence from MEVALA*PRO to MFPPRO 
(the "mp" mutein). For the "mp" mutein, such N-terminus modification permits more consistent removal of 
the N-terminal methionine. As is already known, in bacterial expression systems, cleavage at the N-terminal 

15 methionine occurs. Likewise, the native EPO N-terminal sequence (with the added MET) begins 
METALATRO and it may prove advantageous to delete the first alanine to obtain an mpEPO mutein. With 
regard to G-CSF, the natural human N-terminal sequence begins with METTHRTRO (with the MET added 
for bacterial production) and it may be desirable to delete this N-terminal threonine to advantageously 
obtain a mpG-CSF mutein. 

20 Alternatively, the natural IL-3 sequence may be further modified such that the first two amino acids at 
the N-terminus of the mature polypeptide are deleted, leaving a terminus beginning with 
METTHR'GLNTHR* (the "m3" mutein). For the "m3" mutein, such N-terminus modification permits one to 
take advantage of the methionine at position 3 in the naturally occurring human IL-3 molecule, as the 
initiator methionine. 

25 The CAVs of this invention make it possible to produce homogeneous, biologically active IL-3, G-CSF 
and EPO compositions substantially specifically and consistently modified at selected positions with 
sulfhydryl reactive compounds (described hereinafter). 

In the practice of this invention, at least one cysteine residue is introduced in that portion of the IL-3, G- 
CSF or EPO polypeptide where modification via a sulfhydryl reactive compound is desired. The cysteine 

30 residue or residues are so introduced by genetic engineering methods as described below. Novel cysteine 
residues may be engineered into the polypeptide for example, by simple insertion of a cysteine codon into 
the DNA molecule at the desired site or by converting a desirably located asparagine or other codon to a 
cysteine codon. Convenient methods for site specific mutagenesis or DNA synthesis for producing a DNA 
molecule encoding the desired CAV, expression in procaryotic or eucaryotic host cells of the DNA molecule 

35 so produced, and recovery of the CAV produced by such expression are also disclosed. 

The CAVs of this invention retain useful biological properties of the natural or previously known protein 
and may thus be used for applications identified for the non-modified parent. Modification with such 
sulfhydryl reactive compounds, however, is preferred. Such biologically active, modified CAVs can be 
produced in homogeneous compositions which, it is contemplated, will provide improved pharmacokinetic 

40 profiles, immunogenicity profiles, and/or solubility characteristics relative to the parent polypeptides. 
Furthermore, CAVs may enable the formation of muKimeric forms of the normally monomeric polypeptide 
with the same, albert improved characteristics. Multimeric CAVs also enable the formation of "hetero- 
conjugates"- i.e. ( two or more distinct polypeptides joined via the sulfhydryl groups of the added cysteine 
residues, e.g., IL-3 joined to EPO or IL-3 joined to G-CSF. 

45 Biological activity of the CAVs before or after modification with the sulfhydryl reactive compounds may 
be determined by standard in vitro or in vivo assays conventional for measuring activity of the parent 
polypeptide. Alternatively, we provide herein a "small scale" screening method wherein successful Cys 
modification and attachment of the sulfhydryl reactive compound may be tested. 

Selective and homogeneous modification of the CAVs with sulfhydryl reactive compounds is possible 

so since such compounds will covalently bond primarily only to the cysteine residue(s) in the CAV. Secondary 
reactivity at His, Lys and Tyr residue(s) may be observed, depending on the choice of sulfhydryl reactive 
compound, but at a significantly lower rate. The modified CAVs so produced may then be recovered! and if 
desired, further purified and formulated into pharmaceutical compositions by conventional methods. 

Sulfhydryl reactive compounds include compounds such as polyalkylene glycol, e.g. polyethylene and 

55 polypropylene glycol, as well as derivatives thereof, with or without coupling agents or derivatization with 
coupling or activating moieties, for example, with thiol, triflate, tresylate, aziridine or oxirane, or preferably 
with S-pyridyl or maleimide moieties. Compounds such as S-Pyridyl Monomethoxy PEG and Maleimido 
Monomethoxy PEG are exemplary. Additionally, sulfhydryl reactive compounds include, but are not limited 
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to, charged or neutral polymers of the following types: dextran. colominic acids or other carbohydrate based 
polymers, polymers of amino acids and biotin derivatives, resulting in a protein modified with this well 
known affinity reagent often used for antibody based assays. 

Briefly, the method comprises reacting the CAV with a sulfhydryl reactive compound under suitable 

5 conditions, preferably non-denaturing conditions, and in sufficient amounts permitting the covalent attach- 
ment of the sulfhydryl reactive compound to the introduced cysteine residue(s) present in the polypeptide 
backbone of the CAV. The reaction may be reducible or non-reducible; and generally, the amount of 
sulfhydryl reactive compound used should be at least equimolar to the number of cysteines to be 
derivatized, although use of excess sulfhydryl reactive compound is preferred, both to improve the rate of 

jo reaction and to insure consistent modification at all reactive sites. The modified CAV produced, may then 
be recovered, purified and formulated by conventional methods. See e.g., WO 87/00056 and references 
cited therein. 

Other aspects of the present invention include therapeutic methods of treatment and therapeutic 
compositions which employ the modified CAVs of the present invention, either alone or with other 
75 lymphokines, hematopoietins and/or growth factors, such as granulocyte macrophage colony-stimulating 
factor (GM-CSF) , macrophage colony-stimulating factor (M-CSF), IL-1, IL-2, IL-4, IL-5. IL-6 and IL-10. These 
methods and compositions take advantage of the improved pharmacokinetic properties of these modified 
CAVs to provide treatments, e.g., such as employing lower dosages * of polypeptide, less frequent 
administration, lower immunogenicity and more desirable distribution, required for the therapeutic indica- 
20 tions for the natural polypeptide. 

Other aspects and advantages of the present invention will be apparent upon consideration of the 
following detailed description of the invention, including illustrative examples of the practice thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 

Fig. 1 is the human IL-3 gene construct for E. coli expression, having the polypeptide sequence shown 
of natural (wild type) human IL-3, plus an initiator methionine, as expressed in E.coli. with the amino acids 
numbered from the N-terminus for reference to the muteins discussed herein. 

Fig. 2 is the human G-CSF gene construct for E. coli expression, having the polypeptide sequence 
30 shown of natural (wild type) human G-CSF, plus an initiator methionine, as expressed in E. coli with the 
amino acids numbered from the N-terminus for reference to the muteins discussed herein. 

Fig. 3 is a chemically synthesized human EPO gene construct for E. coli expression, having the 
polypeptide sequence of natural (wild type) human EPO. plus an initiator methionine, as expressed in E. coli 
with the amino acids numbered from N-terminus for reference to the muteins discussed herein. 

35 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention involves the selective modification of IL-3, G-CSF and EPO for pharmaceutical 
use, to both enhance their pharmacokinetic properties and provide homogeneous compositions for human 

40 therapeutic use. Although human IL-3, DNA and peptide sequences are preferred as the starting point in 
this invention as it relates to IL-3, any primate IL-3 is susceptible to use in the method of the invention, 
given the significant homology between e.g., human and gibbon species of the protein and DNA. See Leary 
et ah, Blood (1982) 70: 1343-1348. The method for selectively modifying IL-3, G-CSF and EPO involves 
selecting locations in the polypeptide sequence for the attachment of sulfhydryl reactive compounds. This 

45 step may be accomplished by altering the amino acid sequence of the polypeptide by inserting cysteine 
residues at selected sites or by converting selected endogenous residues into cysteine residues. For 
example, the codons AAA or AAG, which code for lysine, can be changed to the codon TGC or TGT, which 
code for cysteine. 

CAVs in accordance with this invention also include allelic variations in the protein sequence, i.e. 

so sequence variations due to natural variability from individual to individual, or with other amino acid 
substitutions or deletions which still retain desirable biological properties of the parent 

All CAVs of this invention may be prepared by expressing recombinant DNA sequences encoding the 
desired variant in host cells, e.g. procaryotic host cells such as E. coli, or eucaryotic host cells such as 
yeast or mammalian host cells, using methods and materials, e.g. vectors, as are known in the art. Host 

55 cells containing and capable of expressing the CAV-encoding DNA are thus encompassed by this invention. 
DNA sequences encoding the variants may be produced synthetically or by conventional site-directed 
mutagenesis of DNA sequences encoding the protein or polypeptide or analogs thereof. Figure 1 shows the 
human IL-3 gene construct inserted in plasmid pAL-hlL3-781 and expressed in the E. coli K1 2 strain 
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designated GI586. This strain containing the plasmid was deposited with the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville. Maryland 20852 USA on April 19. 1989 and given accession 
number 67932. Other DNA sequences for natural primate IL-3 have been cloned and the DNA sequences, 
including cDNA sequences, and specific peptide sequences for the same have been published, in PCT 

s application number US87/01702, published as WO 88/00598 on January 28, 1988, and are therefore known 
in the art. These DNA sequences have been deposited with the American Type Culture Collection and 
given accession numbers ATCC 67154, 67326, 67319 and its replacement 68042, and 40246. DNA 
sequences for natural G-CSF and EPO have been cloned and the sequences and their corresponding 
peptide sequences published and are therefore known in the art 

10 DNA molecules encoding natural human IL-3s, G-CSFs and EPOs therefore may be obtained (i) by 
cloning in accordance with the published methods, (ii) from the deposited plasmids, or (Hi) by synthesis, 
e.g. using overlapping synthetic oligonucleotides based on the published sequences which together span 
the desired coding region. Such methods are known in the art See the foregoing PCT application published 
as WO 88/00598 and PCT application number US88/00402 published as WO88/06161. 

is As mentioned above, DNA sequences encoding individual CAVs of this invention may be produced 
synthetically or by conventional site-directed mutagenesis of a DNA sequence encoding the parental 
polypeptides or analogs thereof. Such methods of mutagenesis include the M13 system of Zoller and 
Smith. Nucleic Acids Res. (1982) 10:6487 - 6500; Methods Enzymol. (1983) 100:468-500; and DNA (1984) 
3:479-488, which uses single stranded DNA and the method of Morinaga et at, Bio/technology (July 1984) 

20 636-639, which uses heteroduplexed DNA. Exemplary oligonucleotides used in accordance with such 
methods are described below. It should be understood, of course, that DNA encoding each of the CAVs of 
this invention may be analogously produced by one skilled in the art through site-directed mutagenesis 
using appropriately chosen oligonucleotides. 

The new DNA sequences encoding the CAVs of this invention can be introduced into appropriate 

25 vectors for heterologous expression in the desired host cells, whether procaryotic or eucaryotic. The activity 
produced by the transiently transfected or stably transformed host cells (or their progeny) may be 
measured by using standard assays conventional for the parental protein. Where the host cell is bacterial, 
the DNA should be free of introns, e.g. a cDNA or synthetic DNA, and may be free of any secretory leader 
sequence. For eucaryotic expression, introns may be present or absent and a secretory leader sequence 

30 should preferably be present 

The CAVs produced by expression in the genetically engineered host cells may then be purified, and if 
desired formulated into pharmaceutical compositions by conventional methods, often preferably by methods 
which are typically used in purifying and/or formulating the parental protein. It is contemplated that such 
pharmaceutical compositions containing the CAV in admixture with a pharmaceutically acceptable carrier 

35 will possess similar utilities to those of the parental proteins, such as those set forth in WO 88/00598 supra , 
at page 3. 

In another, and preferred, aspect of this invention, the CAVs produced by recombinant means as 
mentioned above are reacted with the desired sutfhydryl reactive compound under conditions permitting 
attachment of the sutfhydryl reactive moiety to the sutfhydryl group of the introduced cysteine residues in 
40 the peptide backbone of the CAV. These modified CAVs, preferably produced initially on a small scale, may 
then be screened for bioactive muteins possessing the sutfhydryl reactive compounds attached to the site 
or sites desired. Alternatively, this screening may be accomplished before attachment with the sutfhydryl 
reactive compound. 

The term "sutfhydryl reactive compound" is defined herein as any compound having, or capable of 
45 being activated to have, a reactive group capable of forming a covalent attachment to the sutfhydryl group 
(-SH) of the cysteine residue. Included among such compounds are polymers such as PEG and poly- 
propylene glycol (PPG), dextran, colominic acids or other carbohydrate based polymers and polymers of 
amino acids and biotin derivatives. Activation may occur by modification of the compound with a sutfhydryl 
moiety, such as a sutfhydryl group, thiol, triflate, tresylate, aziridine or oxirane, or preferably, with S-pyridyl 
so or maleimide. The sulfhydryl reactive compound need not have any particular molecular weight but a 
molecular weight of between about 1,000 and 30,000 for the activated compound is preferred, especially for 
PEG. Methods of attachment will be described in detail below. By controlling the number and location of the 
cysteines in the CAV sequence, the number and location(s) of the attached sulfhydryl reactive compound 
can be selectively controlled. Such control of attachment location and number enables the production of 
55 only certain selectively modified molecules retaining the desired biological activity, rather than production of 
a heterogeneous mixture of variably modified molecules, only some of which may be active. It is also 
important to note that this positional selectivity of the PEGylation or other attachment allows the normal 
functional interactions of the protein to be preserved, blocked, or regenerated by release of the sulfhydryl 
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reactive compound. A . _ w , , 

Another aspect of the invention is therefore homogeneous compositions of modified CAVs as described 
herein e.g. PEGylated CAVs. Specific embodiments of IL-3 CAVs of the invention include human IL-3 which 
has a cysteine residue replacing lysine at position 10 and the m3 initiation sequence. (Amino acid numbers 
for the CAVs of the present invention are used herein in the conventional manner, sequentially from the N- 
terminus, and correlate with the numbering system used in Rg. 1 for the natural human IL-3 as expressed 
in E. coli.) Similarly, the naturally occurring lysine residue in human IL-3 (Rg. 1) at amino acid position 100 
ma7 be converted to a cysteine to create a human IL-3 CAV of the invention. Another embodiment has 
cysteine at positions 9 and 10 and the m3 initiation sequence. 

Specific embodiments of G-CSF and EPO CAVs of the invention include human G-CSF which has an 
alanine residue replacing the naturally occurring cysteine residue at position 17 and a cysteine residue 
replacing the naturally occurring alanine at position 37, and human EPO which has a cysteine residue 
replacing serine at position 9. The modification of Ala!7 of G-CSF is made to prevent possible improper 
disulfide bridge formation. 

For bacterial expression where the secretory leader-encoding DNA sequence is removed from the CAV- 
encoding DNA, it may be desirable to additionally modify the sequence such that it encodes an N-terminus 
comprising Met-Prc— (the mp mutein) instead of other N-termini such as Met-Ala-Pro fin IL-3 and EPO) or 
Met-Thr-Pro fin G-CSF). Such N-terminal modification permits more consistent removal of the N-terminal 
methionine. Alternatively, the first two residues of natural, human IL-3 may be deleted, leaving the naturally 
20 occurring methionine at position 3 as the translation initiator (the m3 mutein). 

CAVs of this invention, modified as described, encompass CAVs containing other modifications as well, 
including truncation of the peptide sequence, deletion or replacement of additional amino acids with amino 
acids other than cysteine, insertion of new N-linked glycosylate sites, abolishment of natural N-linked 
glycosylation sites, etc., so long as the bioactivity of the molecule is retained. Thus, this invention 
encompasses CAVs encoded for by DNA molecules which are capable of hybridizing under stringent 
conditions to the DNA molecule encoding the parental IL-3, G-CSF or EPO (or would be so capable but for 
the use of synonymous codons) so long as the encoded polypeptide contains one or more additional 
introduced cysteine residues relative to the parental peptide sequence. Exemplary stringent conditions can 
be found in T. Maniatis, et aL, Molecular Cloning (A Laboratory Manual) , Cold Spring Harbor Laboratory 
(1982), pages 387-389. An example of one such stringent hybridisation condition is 4XSSC at 65°C, 
followed by a washing in 0.1XSSC at 65°C for an hour. Alternatively, an exemplary stringent hybridization 
condition is 50% formamide. 4XSSC at 42°C. 

Because the method and compositions of this invention provide homogeneous modified IL-3s, G-CSFs 
and EPOs, the invention also encompasses such homogeneous compositions for pharmaceutical use which 
comprise a therapeutically effective amount of a modified CAV described above in admixture with a 
pharmaceutically acceptable carrier. Such composition can be used in generally the same manner as that 
described for the natural or recombinant polypeptides. It is contemplated that the compositions will be used 
for treating a variety of conditions, e.g. involving stimulating hematopoiesis or improving a patient's 
hematological profile. For example, a modified IL-3 of the present invention may be used as an adjunct to 
cancer chemotherapy, radiation therapy, or in the treatment of immune disorders, as discussed in WO 
88/00598, at page 17-19. The exact dosage and method of administration will be determined by the 
attending physician depending on the particular modified CAV employed, the potency and pharmacokinetic 
profile of the particular compound as well as on various factors which modify the actions of drugs, for 
example, body weight sex. diet, time of administration, drug combination, reaction sensitivities and severity 
of the particular case. Generally, the daily regimen should be in the range of the dosage for the natural or 
recombinant unmodified protein, e.g. a range of about 0.1 to about 100 ug of polypeptide per kilogram of 
body weight preferably from about 0.1 to about 30 ug of polypeptide per kilogram of body weight 

The therapeutic method and compositions of the present invention may also include co-administration 
with other drugs or human factors. A non-exclusive list of other appropriate hematopoietins, CSFs (colony 
stimulating factors) and interteukins for simultaneous or serial co-administration with the CAVs of the 
present invention includes GM-CSF, CSF-1 fin its various known forms; CSF-1 is also referred to as M-CSF 
or macrophage colony-stimulating factor), Meg-CSF, IL-1, IL-2. IL-4. IL-6, IL-10, B-cell growth factor, B-cell 
differentiation factor and eosinophil differentiation factor. Additionally, the CAVs of the present invention may 
be administered with, or chemically attached to, monoclonal or polyclonal antibodies in a therapeutic use. 
Alternatively, these growth factors may be attached to certain toxins, e.g., ricin, for use in a therapeutic 
regimen. The dosage recited above would be adjusted to compensate for such additional components in 
the therapeutic composition or regimen. In the case of pharmaceutical compositions containing modified 
lymphokine CAVs, for example, progress of the treated patient can be monitored by periodic assessment of 
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the hematological profile, e.g. white cell count, hematocrit and the like. 

The following examples illustratively describe the CAVs and the methods and compositions of the 
present invention. 

5 EXPERIMENTAL MATERIALS, METHODS AND EXAMPLES 

EXAMPLE 1: Eucaryotic Expression Materials and Methods 

Eukaryotic cell expression vectors into which DNA sequences encoding CAVs of this invention may be 
10 inserted (with or without synthetic linkers, as required or desired) may be synthesized by techniques well 
known to those skilled in this art. The components of the vectors such as the bacterial replicons, selection 
genes, enhancers, promoters, and the like may be obtained from natural sources or synthesized by known 
procedures. See Kaufman et a!., J. Mol. Biol. . (1982) 159: 601-621, Kaufman, Proc. Natl. Acad. Sci. (1985) 
82:68^693. See also WO 87/04187, filed January 2, 1987 {pMT2 and pMT2-ADA), and US Patent 
j 5 Application Serial No. 88,188, filed August 21, 1987 (pxMT2). Exemplary vectors useful for mammalian 
expression are also disclosed in the patent applications cited in Example 4, which are hereby incorporated 
by reference. Eucaryotic expression vectors useful in producing variants of this invention may also contain 
inducible promoters or comprise inducible expression systems as are known in the art. See US Patent 
Application Serial No. 893,115 (filed August 1, 1986) and PCTAJS87/01871, published as WO88^00975 on 
20 February 11,1988. 

Established cell lines, including transformed cell lines, are suitable as hosts. Normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, as well as primary explants (including relatively 
undifferentiated cells such as hematopoietic stem cells) are also suitable. Candidate cells need not be 
genotypically deficient in the selection gene so long as the selection gene is dominantly acting. 

26 If eucaryotic host cells are used, they will preferably will established mammalian cell lines. For stable 
integration of the vector DNA into chromosomal DNA, and for subsequent amplification of the integrated 
vector DNA, both by conventional methods, CHO (Chinese Hamster Ovary) cells are presently preferred in 
such embodiments. Alternatively, the vector DNA may include all or part of the bovine papilloma virus 
genome (Lusky et al. a Cell (1984) 36: 391-401) and be carried in eel! lines such as C127 mouse cells as a 

30 stable episomal element. Other usable mammalian cell lines include HeU COS-1 monkey cells, melanoma 
cell lines such as Bowes cells, mouse L-929 cells, 3T3 lines derived from Swiss, Balb-c or NIH mice, BHK 
or HaK hamster cell lines and the like. 

Stable transformants then are screened for expression of the CAV product by standard immunological 
or activity assays. The presence of the DNA encoding the CAV polypeptides may be detected by standard 

35 procedures such as Southern blotting. Transient expression of the CAV genes during the several days after 
introduction of the expression vector DNA into suitable host cells such as COS-1 monkey cells is measured 
without selection by activity or immunologic assay of the proteins in the culture medium. 

Following the expression of the DNA by conventional means, the CAVs so produced may be recovered, 
purified, and/or characterized with respect to physicochemical, biochemical and/or clinical parameters, all by 

40 known methods. 

EXAMPLE 2: Bacterial and Yeast expression 

Bacterial and yeast expression may be effected by inserting (with or without synthetic linkers, as 
45 required or desired) the DNA molecule encoding the desired CAV into a suitable vector (or inserting the 
parental DNA sequence into the vector and mutagenizing the sequence as desired therein), then transform- 
ing the host cells with the vector so produced using conventional vectors and methods as are known in the 
art, e.g. as disclosed in published PCT Application No. WO 86/00639. published January 30. 1986. 
Transformants are identified by conventional methods and may be subcloned H desired. Characterization of 
so transformants and recombinant product so produced may be effected and the product recovered and 
purified, all as described in Example 1 . 

For bacterial expression, the DNA sequences encoding the CAVs are preferably modified by conven- 
tional procedures to encode only the mature polypeptide and may optionally be modified to include 
preferred bacterial codons. 

55 
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Expression in E. coli : 

The IL-3 CAVs of Example 5 were expressed in E. coli as follows: Plasmid pAL-hlL3-781 was 
transformed into an E. coli K12 strain GI586, a derivative of strain W31 10 in which the d and Rex regions of 
bacteriophage lambda carrying the C,857 allele have been inserted into the Clal site of the lacZ gene of the 
bacterial genome. This insert consists of all of the DNA sequences between nucleotides 35711 and 38104 
of the phage genome. See F. Sanger, et al., J. Mol. Biol. (1982) 162:729. E. coli K12 strain GI586 (pAL-hlL3- 
781) was deposited at the ATCC on April 19, 1989 and given accession number 67932. 

When GI586 transformed with pAL-hlL3-781 is grown at 30 degrees centigrade to high cell density and 
then heated to 40 degrees centigrade, IL-3 is produced rapidly and accumulates over the next two or three 
hours to reach greater than 10 percent of the total cellular protein. This protein is produced in an insoluble 
form which must be solubilized and refolded by conventional methods. See, e.g., T.E. Creighton, Prog. 
Biophys. Molec. Biol. (1978) 33231-297. Following expression, the CAVs so produced were recovered, 
purified and characterized as follows. 

The G-CSF CAV of Example 5 and the EPO CAV of Example 6 were expressed similarly in E. coli by 
removing the DNA sequence encoding CAV IL-3 from pAL-hlL3-781 and inserting the appropriate G-CSF or 
EPO DNA sequence as set forth in those Examples. Transformations were carried out under the same 
conditions set forth above. 

1 . Purification of CAV IL-3 

All buffers were prepared using glass distilled water; all were degassed for at least five minutes, using 
house vacuum/sonification, prior to the addition of DTT. 

First. 400 grams wet weight frozen E. coli cell paste was suspended in 2500 ml of buffer containing 50 
mM Tris-HCI, pH 8.5, 1 mM EDTA, 5 mM P-aminobenzamidine, 1 mM PMSF and 2 mM DTT (hereinafter in 
this Example "buffer A"), to obtain a final volume of 2850 ml. Glass rods and magnetic stirrers were used to 
resuspend the cell paste. Then the cell suspension was lysed by passing it through a matin gaulin valve at 
9000 psi four times, with cooling between each time. Temperature was maintained below 30 degrees 
centigrade by collection of the lysate into glass vessels cooled in iceJwater mixture. Protein concentration 
was 22 mg/ml; final volume was 2850 ml. 

The lysate was centrifuged for 30 minutes at 8000 rpm in a Sorval centrifuge with a GS-3 rotor. The 
supernatant (2600 ml at 17.0 mg/ml) was discarded and the resultant pellet (hereinafter in this Example PI) 
from this centrifugation was resuspended in approximately 400 ml buffer A, using glass rods and a 
magnetic stirrer. The milky suspension was then passed through an 18 gauge needle using a 60 ml syringe. 
The final volume was 640 ml, with a protein concentration of 25.6 mg/ml. 

The resuspended P1 pellet was then centrrfuged for 10 minutes in a Sorval centrifuge with a GS-3 rotor 
at 8000 rpm. The supernatant from this centrifugation was poured into two fresh centrifuge tubes 
(hereinafter in this example "S2") and the resultant pellet ("P2") was resuspended in buffer A to a final 
volume of 165 ml, with a protein concentration of 50 mg/ml. The S2 supernatant was then centrrfuged for 10 
minutes and the resulting pellet ("P3") was resuspended in 65 ml buffer A. The resulting supernatant ("S3") 
was further centrifuged for 10 minutes and the resulting P4 pellet was resuspended in buffer A to a final 
volume of 50 ml, with a protein concentration of 11.3 mg/ml. Because the P4 pellet contained so little IL-3, it 
was not used in subsequent steps. The S4 supernatant from the final centrifugation, approximately 600 ml, 
contained the membranous components at a concentration of approximately 10 mg/rnl. 

The P2 and P3 pellets were pooled and centrifuged at 9000 rpm (GSA rotor) for 10 minutes yielding 
two pellets ("P2-2") and a cloudy supernatant, which using HPLC analysis was found void of IL-3 and was 
discarded. The P2-2 pellet was frozen at -20 degrees centigrade for later use. 

The frozen P2-2 pellet was then resuspended in buffer A (which contained 10 mM DTT rather than 2 
mM DTT) to a final volume of 100 ml using glass rods and magnetic stirrer and then passed through an 18 
gauge needle. 400 ml of 7 M fresh guanidine in the 10 mM DTT buffer A was added to the resuspended 
P2-2 and after one quick inversion, the solubilized P2-2 pellet was immediately placed in 3 x 250 centrifuge 
tubes and centrifuged for 15 minutes at 8000 rpm (GSA rotor). 500 ml of the supernatant at a concentration 
of 5.98 mg/ml was purified further at room temperature by RP-HPLC. The foregoing two steps were 
performed in 17 to 22 minutes. 

This purification protocol may be applied to the purification of the G-CSF and EPO CAVs expressed in 
E. coli as set forth above with similar results. 
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2. RP-HPLC separation of lL-3 CAVs 

The buffers used in this separation protocol were 0.1% (v/v) TFA in water, and 0.1% TFA in acetonitrile. 

A two inch Vydac C4 column was equilibrated in 10% acetonitrite. The supernatant from the 7 M 
guanidine solubilization was immediately applied onto the C4 column having a volume of approximately 470 
ml at 180 ml per minute. The column was developed at 20 ml per minute and was washed in 10% 
acetonitrite until absorbance at 280 nm was back to baseline. The following gradient was established by 
washing with the following concentrations of acetonitrile at the following times: 



time (in minutes) 


% acetonitrile 


5 


10 


10 


35 


55 


55 


60 


80 


65 


80 


67.5 


10 



40 ml fractions were collected after 35 minutes into the gradient. 10 ill samples were removed from 
each fraction, vacuum speed dried and taken up in 20 ul of 2x SDS-sample Laemmli buffer. SDS-PAGE 
analysis was performed and IL-3 presence was confirmed. All fractions were then frozen at-80 degrees 
centigrade 

Similarly, RP-HPLC separation of G-CSF and EPO CAVs may be accomplished. 

3. Refolding of iL-3 CAVs 

One of the RP-HPLC separated fractions containing approximately 75 mg (7.5 ml) IL-3 was diluted to 
approximately 0.5 mg/ml by the addition of 142.5 ml of 6.4 M guanidine in 50 mM NaPO* pH 7.0, 1 mM 
EDTA and 02 mM DTT. The mixture was then added to 750 ml of 50 mM NaPO* pH7, 1 mM EDTA, 02 
mM DTT buffer, transferred to dialysis tubing and dialyzed for two hours against 4 L of the same buffer. 
The IL-3 (now approximately 0.22 M guanidine) was twice further dialyzed against 8 L of the same buffer 
containing 0.1 mM DTT. 

PEGylation of this purified IL-3 is set forth in Examples 7 and 8 below. 

Refolding of G-CSF and EPO CAVs may be accomplished in the same manner. 

4. Confirmation of bioactrvtty 

Bioactivtty of IL-3 CAVs or EPO CAVs may be confirmed by using the TF-1 cell proliferation assay. The 
TF-1 cell line has been described (Kitamura et al. f Blood (1989) 73:375-380). 

In that assay, cells are maintained at 37 degrees centigrade in humid air containing 5% CQ2 and 
culture media used is RPMI (Gibco). 10% heat inactivated fetal calf serum, 2 mM L-glutamine, with 5 ng/ml 
of recombinant GM-CSF added. Every 3-4 days cells are adjusted to a density of 2 x 10* cells/ml. Just 
prior to assay the cells are centrifuged 500xG, 5 minutes, washed in culture media without rGM-CSF, 
recentrifuged and resuspended at a density of 10 5 cells/ml. 

IL-3 or EPO samples to be assayed are diluted between 1:500 and 1:10,000 in culture media without 
rGM-CSF. 125 ul of the diluted sample is placed in the top row of a 96 well microtiter plate. The remaining 
wells are filled with 100 ul of culture media without rGM-CSF and the top row samples are serially diluted 
five fold down the microtiter plate. To each well. 100 ul of diluted cells (10 4 cells) are added and the plate 
is incubated at 37 degrees centigrade, 5% C° 2 for 48-72 hours. Thereafter, 0.5 uCi 3 H-thymidine is added 
per well and the plate is further incubated for 4-6 hours. Cells are then harvested using ah automated cell 
harvester (LKB 1295-001) and the 3 H-thymidine uptake is quantitated. 

Alternatively for IL-3. a CML proliferation assay as described in PCT/US87/017024. International 
Publication Number WO88/00598, published January 28, 1988, can be used. 

Bioactivtty of G-CSF CAVs may be confirmed using the 32D murine cell line, as described in Hapel. et 
al., Blood (1984) 64:786-790, and adding 5% v/v of WEHI-3 conditioned media from a 48 hour culture of 
V\Er¥3cells, ATCC TIB68, (1x10* cells/ml) in RPMI-1640 media supplemented with 2 mM L-glutamine 
instead of the 5 ng/ml recombinant GM-CSF in the TF-1 cell proliferation protocol. The wash and assay 
steps are then carried out in the absence of WEHW conditioned media. 
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EXAMPLE 4: Mutagenesis Protocol 

Site directed mutagenesis may be effected using conventional procedures known in the art. See e.g., 
International Applications Nos. WO 87/07144, and WO 87/04722, and US Patent Application Serial Nos. 
s 099,938 (filed September 23, 1987) and 088,188 (filed August 21, 1987) and the references cited therein. 

EXAMPLE 5: Exemplary Mutagenesis Reactions 

The following human IL-3, G-CSF and EPO muteins were engineered by substitution of the codons 
io indicated for a cys codon, or by insertion of a cys codon, using conventional site directed mutagenesis 
techniques: 
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IL-3 



top route in 

mpCyslO 

mpCys6 

mpCys8 

mpCysl2 

rapCyslOO 

mpCysl34 

mpCys3 
mp&lCysl9 



m3 mutein 

m3cyslO 

m3Cys6 

m3Cys8 

m3Cysl2 

m3CyslOO 

m3Cysl34 



m3Cys6, 10 

m 3 Cys 9, 10 
m3Cys6, 8 



cys modification 

AAA to TGC (Lys to Cys) 

ACT to TGC (Thr to Cys) 

TCT to TGC (Ser to Cys) 

TCT to TGC (Ser to Cys) 

AAG to TGT (Lys to Cys) 

Insertion of TGT between 
TTC and TAG (Cys between 
Phe 133 and stop codon) 

ATG to TGC (Met to Cys) 

Replacement of amino acids 

1-15 with the "mp" 

terminus and modif. of 

pos. 19 from ATG to TGC 
(Met to Cys) 

ACT and AAA to 
TGC (Thr and Lys 
to Cys) 

TTA and AAA to TGC (Leu 
and Lys to Cys) 

ACT and TCT to TGC (Thr 
and Ser to Cys) 



m3Cys6,8,10 ACT, TCT and AAA to TGC 
(Thr, Ser and Lys to Cys) 

m3Cys8,9,10 TCT, TTA and AAA to TGC 
(Ser, Leu and Lys to Cys) 



G-CSF 



mp mute in 
mpAlal7Cys37 



cys modification 

GCC to TGC (Ala to Cys) 



EPO 



mp mutein 
mpCys9 



cys modification 
TCT to TGT (Ser to Cys) 



In the examples depicted above the modification she of the natural IL-3. G-CSF or EPO protein is 
designated by the number after "Cys" and the amino acid sequence of the CAV is identical to that of the 
native protein, except for the position indicated, with respect to the N-terminus (see Fig. 1). The "mp" and 
"m3" designations signify the two different alterations of the N-terminus that will be discussed in detail 
below. Additionally, cys may be introduced in place of native IL-3 codons, for example at positions 63 or 66, 
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alone or in combination with other cys introduction(s), e.g. at position 10-with any of the descnbed N- 
termini. Contemplated EPO muteins include EPO mpCys166. having the mp N-terminus and the native 
arginine at position 166 deleted and replaced with cysteine, and mpCys24Cys38Cys83. having the mp N- 
terminus and the three N-linked glycosylation sites at the asparagine amino acids replaced with cysteines. 

With respect to IL-3 muteins. certain point modifications may result in partial loss of biological activity 
or inability of the sulfhydryl reactive compound to attach. For example, modification at position 28 results in 
a biologically active CAV. but attachment of a sulfhydryl reactive compound fails, possibly because position 
28 appears internally in the refolded CAVs tertiary structure. Compare. WingfiekJ, D., et al„ Eur. J. Biochem. 
(1989) 179:565-571. in which the authors discussed the Cys modification of IL-10 at position 138 to active 
iL-10-phycoerythrin conjugate. Additionally, we have found that substitution of a cysteine residue for the 
amino acids at positions 15 or 51 of the natural human IL-3 may result in partial loss of bioactivrty. To test 
for activity after attachment of the sulfhydryl reactive compound, this invention further provides a "small 
scale" screening technique to readily determine whether modification and attachment has been successful 
(see Example 9 below). 

The human IL-3 was additionally modified at its N-terminus in two different and alternative configura- 
tions, represented by the "mp" and "m3" designations. The "mp" designation indicates a deletion of the 
first alanine in the natural human IL-3 protein, thereby changing the N-terminaJ sequence from 
MET"ALA*PRO to METTRO. The "m3" designation indicates a deletion of the first two amino acids in the 
natural human IL-3 protein, METALAVRO, to yield a terminus beginning METTHR'GLLPTHR*. The reasons 
for these modifications have already been discussed. With respect to N-terminus modification of the IL-3 
mpA1Cys19 mutein. amino acids 1-15 were deleted and replaced with the "mp" terminus. TTie human G- 
CSF and EPO muteins were additionally modified at the N-terminus by deletion of the first amino acid to 
obtain "mp" muteins. 

K should be understood of course that the depicted list of muteins is merely exemplary and not 
exclusive. The design and synthesis of alternative and additional muteins in accord with this invention is 
well within the present skill in the art. Synthesis of such muteins may be conveniently effected using 
conventional techniques and methods. 

One skilled in the art, of course, could readily design and synthesize other muteins for substitution of 
cysteine codons or insertion thereof in DNA sequences encoding IL-3. G-CSF and EPO. To modify more 
than one site, mutagenesis may be carried out iteratively. or in some cases using an oligonucleotide 
designed for mutagenesis at more than one site. 

EXAMPLE 6: Synthesis of DNA molecules encoding CAVs 

As an alternative to the production of CAV-encoding DNA by mutagenesis of the parental DNA 
sequence, it should be understood that the desired CAV-encoding DNA may be prepared synthetically. In 
that case, it will usually be desirable to synthesize the CAV DNA in the form of overlapping 
oligonucleotides, e.g. overlapping 50-80mers. which together span the desired coding sequence and contain 
the cysteine additions desired: 



An exemplary EPO mutein made in accordance with this Example is EPO mpCys9. The chemically 
synthesized cDNA shown in Figure 3 is assembled, modified at position 9. and purified using techniques 
know to those skilled in the art See Wosnick, et al., Gene 60 :115-127 (1987); see also U.S. Patent No. 
4,904.584. The synthetic cDNA is designed with "overhanging end" nucleotide sequences compatible with 
those generated by the restriction enzymes, Ndel and XbaJ. The purified, synthetically derived, cDNA is 
ligated with the purified Ndel-Xbal vector portion of plasmid pAL-hlL3-781 , resulting in the replacement of 
human IL-3 cDNA with human erythropoietin cDNA EPOmpCys166 and EPO mpCys24Cys38Cys83 can be 
made in the same manner. 

Given a desired coding sequence, the design, synthesis, assembly and ligation, H desired, to synthetic 
linkers of other appropriate oligonucleotides is well within the present level of skill in the art 
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EXAMPLE 7: PEGylation of the IL-3 mpCyslO mutein 

The mutein human IL-3 mpCyslO was prepared in accordance with Example 5 above and PEGylated 
with two PEG 5000 derivatives. S-Pyridyl Monomethoxy PEG 5000 (PEG 5000 SPDP) and Maleimido 
5 Monomethoxy PEG 5000 (PEG 5000 SMCC). 

1. PEGylation with S-Pyridyl Monomethoxy PEG 5000: a reducible linkage 
a.) Preparation of the suffhydryl reactive compound. 

'° PEG 5000 was activated for attachment to a sulfhydryl group as follows. 2.0 grams of Monomethoxy 
PEG 5000 amine was dissolved in 12 ml dry peroxide free, dioxane. 144 mg (15% excess) of N- 
succinimidyl-3-(2 pyridyldithio) propionate (SPDP) was added as a dry powder and the reaction was allowed 
to proceed at room temperature. After 24 hours, the S-pyridyl Monomethoxy PEG 5000 product was 

is precipitated using dry, peroxide free diethyl ether and washed with ether. The product was dried under 
vacuum to obtain 1.92 grams of white solid, which was identified as S-Pyridyl Monomethoxy PEG 5000 by 
NMR and IR. The PEG 10,000 analog (PEG 10,000 SPDP) was likewise prepared via an analogous 
procedure. 

20 b.) PEGylation of mutein C10 human IL-3. 

For this coupling, natural (wild type) human IL-3 was also treated with the PEGylation reagents as a 
negative control. A stock solution at 1 mg/ml of the mpCyslO mutein in a pH 7 buffered solution of 50 mM 
NatePO* 100 micro M DTT, 1 mM EDTA and about 3mM Guanidine HCI was used. DTT was added to 

25 prevent dimerization of the protein; EDTA was added to prevent dimerization via metal mediated oxidative 
coupling Guanidine remains as an artifact of the refolding of the protein. A pH 7 was used; a range of 6.5- 
7 5 is preferred. 0.9 mg of S-Pyridyl Monomethoxy PEG 5000, prepared as set forth above, was weighed 
into an Eppendorf tube. 360 microliters of the buffered mutein was added and the mixture was vortexed 
briefly to homogeneity. The reaction was performed at 4 degrees centigrade and when sampled after 2 

30 hours, was found to be complete. Analysis on a 10-20% gradient SDS acrylamide gel stained with 
Coomassie blue showed the product as nearly pure and running at about 28 kD. (By comparison. mpCyslO 
and its dimer were used as standards and found to migrate to 15 and 30 kD respectively.) A reducing lane 
on the gel showed that the PEGylated IL-3 mutein is sensitive to reduction by DTT and regenerated the 
original protein at about 15 kD. 

35 

2. PEGylation with Maleimido Monomethoxy PEG 5000: a non-reducible linkage 
a.) Preparation of the sulfhydryl reactive compound. 

40 In this experiment PEG 5000 activation was accomplished as follows. 2.0 mg of monomethoxy PEG 
5000 amine was dissolved in 12 ml of dry, peroxide free dioxane. 154 mg of sulfosuccinimidyl 4-(N- 
maleimidomethyi) cyctohexane-1-carboxylate (SMCC) (15% excess) was added as a dry powder and the 
reaction was allowed to proceed at room temperature. After 24 hours, work up of the product was earned 
out in the same manner as the S-Pyridyl Monomethoxy PEG 5000 to obtain 1.82 grams Maleimido 

45 Monomethoxy PEG 5000. The PEG 10.000 analog was prepared similarly. 



b.) PEGylation of the cys10 IL-3 mutein. 

This reaction was carried out in the same manner as the PEGylation reaction using the reducible PEG 
so 5000 reagent and with natural human IL-3 as a negative control. However. 1.0 mg of the PEG derived 
PEGylating agent Maleimido Monomethoxy PEG 5000 was used. 400 microliters of the mpCyslO IL-3 
mutein was added and vortexed to homogeneity. Ait =2 hours the reaction was found to be complete. The 
product was nearly pure and indistinguishable from the S Pyridyl derived conjugate in molecular weight 
However, this product is perfectly inert to reductive conditions, such as DTT; in this reducing lane the 
55 product at 28 kD. persists. 

In both control reactions, nothing indicative of conjugation is evident at 2 hrs or even at 24 hrs. 
Selectivity for accessible sulfhydryls in this chemistry is therefore very high. 
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EXAMPLE 8: PEGylation of multiple cysteine muteins m3Cys9,10 and m3Cys6 t 10 

In this experiment, protein stock for both muteins was at 300 ug/ml in the phosphate buffer solution, as 
described in Example 7. PEGylation stock solutions consisted of the S-pyridyl or Maleimide activated PEG 

5 5000 polymers at 50 ug/ml in the same buffer. To initiate the reaction, 11 ul of the appropriate PEG stock 
was added to 100 ul of the appropriate protein stock (either the m3Cys9.10 mutein or the m3Cys6.10 
mutein) white vortexing. Reactions were allowed to proceed at 4 degrees centigrade overnight. SDS get 
analysis of the products as described above revealed that only a trace of starting material remained with 
both chemistries. Furthermore, both chemistries resumed in new products with a gel mobility of about 37 kD. 

70 Reducing lanes on this same gel show that the maleimide conjugate is resistant to reducing, while the S- 
Pyridyl derived conjugate reverts to starting material. 

EXAMPLE 9: PEGylation of the G-CSF mpAla17Cys37 mutein 

75 The mutein human G-CSF mpAla17Cys37 was prepared in accordance with Example 5 above and 
PEGytated with PEG 5000 SPDP, PEG 5000 SMCC and PEG 10,000 SPDP. The natural cysteine at position 
17 was deleted and replaced with alanine to prevent possible improper disulfide bridge formation. The 
mpAla17Cys37 G-CSF protein stock was dialyzed into a pH 7.0 buffering solution of 50mM Nar^PO* and 
1mM EDTA, 100uM DTT and concentrated to approximately 100ug/ml. Total volume was 670 ul. 

20 Stocks of the PEGylation reagents (PEG 5000 SPDP, PEG 5000 SMCC, and PEG 10,000 SPDP) were 
made up fresh at 10mM in H2O (18 x stock solutions prepared as described in Example 7, parts la and 
1b). The reactions were carried out using the reagents and amounts set forth below. 



Rxn# 


PEG Reagent 


Stock 


ul Protein 


Ul PEG 


1 


5000 SPDP 


170 


10 


2 


5000 SMCC 


170 


10 


3 


10,000 SPDP 


170 


10 



All reactions were ImM in PEG reagent. As in Example 7, the mutein was added to the PEG reagent 
and vortexed to homogeneity. The reactions were allowed to proceed at 4°C overnight and found to be 
complete. 

35 Analysis on a 10-20% gradient SDS acrylamide gel with both reducing and non-reducing lanes and 

stained with Coomasste blue showed the mpAla17Cys37 PEG 5000 SPDP and mpAIa17Cys37 PEG 5000 

SMCC products as nearly pure and running at about 28 kD. 

Non-PEGylated G-CSF mute in runs at approximately 19kD. A reducing lane on the gel showed that the 

SPDP conjugate is sensitive to reduction in the presence of DTT. The SMCC derived reagent was totally 
40 resistant to reductive treatment The PEG 10.000 SPDP PEGylated mpAla17Cys37 G-CSF runs at 

approximately 32 kD. The non-PEGylated G-CSF mutein is regenerated by DTT treatment 

EXAMPLE 10: PEGylation of the EPO mpCysO mutein 

45 The mutein human EPO mpCys9 prepared in accordance with Example 5 above can be PEGylated 
using the stock PEGylation reagents prepared in Example 9 and the natural human EPO as a negative 
control. To initiate the reaction, 10 ul of the appropriate PEG stock as prepared in Example 9 is added to 
170 ul of the EPO mpCys9 stock at a concentration of 100ug/ml. The mixture is vortexed and the reaction 
is allowed to proceed at 4°C overnight Upon completion, the reaction can be analyzed as in Example 9. 

so 

EXAMPLE 11: Screening of novel CAVs 

Having the constructed novel DNA molecules encoding CAVs in the appropriate expression vector and 
having attached the suHhydryl reactive compound to the muteins, it may be desirable to produce each CAV 
55 protein on a small scale and "screen" for muteins which possess the desired attachment site or sites. The 
biological activity of each CAV, before and after attachment of the sulfhydryl reactive compound can be 
rapidly assessed using an in vitro assay. 
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Small scale bacterial production of IL-3 CAV muteins 

Bacterial strain GI586 was transformed with purified plasmid DNAs consisting of bacterial expression 
vector, P AL-hlL3-781. ATCC Accession Number 67932, with novel CAV IL-3 coding sequences. The 
transformed cells were spread on LB agar plates containing 50 ug/ml ampicillin at a density to yield 
approximately 100 colonies per plate. 3 ml of L broth plus 50 ug/ml ampicillin was inoculated with a single 
bacterial colony and grown overnight at 30 degrees centigrade. 50 ml of induction media (0.1x L Broth, 
1xM9 salts. 0.4% glucose, 1mM MgSO*. 50 ug/ml ampicillin) was inoculated with 1 ml of the overnight 
culture. The 50 ml culture was grown with aeration at 30 degrees centigrade until an 0.5 OD 600 nm level 
was reached, then the temperature was shifted to 40 degrees centigrade and growth continued for at least 2 
hours. 

Cells were then harvested by centrifugation at 3500 rpm for 5 minutes in a Sorval centrifuge with a 3B 
rotor. The supernatant was discarded and the cell pe»et resuspended in 1 ml of buffer PED (50 mM 
NaHjPOi. pH 7.0, 1 mM EDTA, 5mM DTT, 10 mM PMSF). This 1 ml solution was passed twice through a 
French Press at 10.000 psi and kept on ice. The solution was then microfuged for 5 minutes at 12,000 rpm. 
The supernatant was discarded and the pelleted material was resuspended in 150 Hi of 7 M guanidine-HCI 
in PED buffer. The solution was then diluted with 650 nl of PED buffer and placed in dialysis tubing (10,000 
MWCO Spectrapore). The sample was dialyzed for at least 4 hours against 2 liters of PED.1 buffer (50 mM 
NaH2P0 4 pH 7.0, 1 mM EDTA, 0.1 mM DTT). The sample was collected and microfuged to remove 
precipitated proteins. The sample was then analyzed on a 12% Laemmli SDS PAGE gel and the amount of 
IL-3 protein estimated. 

The protein solution was then concentrated to about 0.5 mg/ml, and 200 ug was reacted with a 15 fold 
molar excess of either S-Pyridyl Monomethoxy PEG 5000 or Maleimido Monomethoxy PEG 5000 for 
several hours at 4 degrees centigrade. The products were then analyzed by SDS PAGE and biological 
activity determined by an in vitro TF-1 cell proliferation assay. 

This small scale production methodology may be similarly advantageously applied to production of the 
novel CAV G-CSFs and EPOs of the present invention. 

Alternatively, this small scale production for screening may be carried out before attachment of the 
surfhydryl reactive compound. In that case, biological activity may still be determined by an in vitro TF-1 
cell proliferation assay (or in the case of G-CSF, a 32D cell proliferation assay) and the products may be 
analyzed by SDS PAGE analysis, in accordance with known techniques. 

The same or similar procedures may be used by one skilled in the art to attach other surfhydryl reactive 
compounds to the other CAVs of the invention. Homogeneity can be observed by conventional analysis of 
the modified CAVs so produced e.g. using standard SDS-PAGE or HPLC analysis. 

Numerous modifications may be made by one skilled in the art to the methods and compositions of the 
present invention in view of the disclosure herein. Such modifications are believed to be encompassed by 
this invention as defined by the appended claims. 

Claims 

1. A cysteine added variant ("CAV") of EPO, characterized in that said CAV comprises a peptide 
sequence of human EPO modified to contain at least one non-native cysteine residue at which residue 
said CAV is covalently attached to at least one surfhydryl reactive compound. 

2. A CAV of claim 1 , characterized in that the N-terminus of the CAV commences with methionine and the 
native amino acid at position 1 of the mature peptide sequence is deleted. 

a A CAV of any of claims 1 to 2, characterized in that said at least one surfhydryl group is selected from 
one or more of the group consisting of dextran, a carbohydrate based polymer, such as a colominic 
acid, a polymer of an amino acid, biotin, and a polyalkylene glycol moiety. 

4. A CAV of any of claims 1 to 2, characterized in that said at least one surfhydryl reactive compound is 
polyetrryiene glycol. 

5. A DNA sequence encoding a CAV comprising a peptide sequence of human EPO modified to contain 
at least one non-native cysteine residue for attachment of at least one sutfhydryl reactive compound. 

6. A host cell containing and capable of expressing a DNA sequence of claim 5. 
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A method of producing a CAV of any of claims 1 to 4, comprising covalently attaching at least one 
surfhydryl reactive compound to at least one non-native cysteine of said CAV produced by culturing a 
host cell containing and capable of expressing a DNA sequence encoding said CAV. 

Use of a CAV of any of claims 1 to 4 for the preparation of pharmaceutical compositions suitable for 
stimulating hematopoiests. 
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FIGURE 1 



1 " 
ATG GCT CCT ATG ACT CAA ACT ACT TCT TTA AAA ACT TCT 
Met Ala Pro Met Thr Gin Thr Thr Ser Leu Lys Thr Ser 

TGG GTA AAC TGT TCT AAC ATG ATC GAT GAA ATT ATA ACA 
Trp Val Asn Cys Ser Asn Met He Asp Glu He He Thr 



CAC TTA AAG CAG CCA CCT TTG CCC TTG CTG GAC TTC AAC 
His Leu Lys Gin Pro Pro Leu Pro Leu Leu Asp Phe Asn 

40 

AAC CTC AAT GGG GAA GAC CAA GAC ATT CTG ATG GAA AAT 
Asn Leu Asn Gly Glu Asp Gin Asp He Leu Met Glu Asn 

55 

AAC CTT CGA AGG CCA AAC CTG GAG GCA TTC AAC AGG GCT 
Asn Leu Arg Arg Pro Asn Leu Glu Ala Phe Asn Arg Ala 

70 

GTC AAG AGT CTG CAA AAT GCA TCA GCA ATT GAG AGC ATT 
Val Lys Ser Leu Gin Asn Ala Ser Ala He Glu Ser He 

85 

CTG AAA AAT CTG CTG CCA TGT CTG CCC CTG GCC ACA GCT 
He Lys Asn Leu Leu Pro Cys Leu Pro Leu Ala Thr Ala 

100 

GCA CCC ACC AGG CAT CCA ATC CAT ATC AAG GAT GGT GAC 
Ala Pro Thr Arg His Pro He His He Lys Asp Gly Asp 

115 

TGG AAT GAA TTC CGC CGC AAA CTG ACC TTC TAT CTG AAA 
Trp Asn Glu Phe Arg Arg Lys Leu Thr Phe Tyr Leu Lys 



ACC CTG GAG AAT GCT CAG GCT CAG CAG ACC ACC CTG AGC 
Thr Leu Glu Asn Ala Gin Ala Gin Gin Thr Thr Leu Ser 

130 

CTC GCG ATC TTC TAG 
Leu Ala He Phe Stop 
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FIGURE 2a 



1 10 



ATG 
MET 



ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe 

20 

CTG CTC AAG TGC TTA GAG CAA GTG AGG AAG ATC CAG GGC 
Leu Leu Lys Cys Leu Glu Gin Val Arg Lys He Gin Gly 

30 

GAT GGC GCA GCG CTC CAG GAG AAG CTG TGT GCC ACC TAC 
Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr 



40 



50 



AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG CTC GGA CAC 
Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His 

60 

TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC 
Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro 

70 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC 
Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 

80 90 
CAT AGC GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC 
His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala 

100 

CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC ACC TTG GAC 
Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp 

110 

ACA CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC 
Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He 



120 



130 



TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu 

140 

CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT 
Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 

150 

TTC CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT 
Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
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FIGURE 2b 



160 

CTG CAG AGC TTC CTG GAG GTG TCG TAC CGC GTT CTA CGC 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg 



170 174 
CAC CTT GCC CAG CCC T 
His Leu Ala Gin Pro 
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FIGURE 3a 

1 10 
Met Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val 
5'ATATG GCA CCA CCA AGA TTA ATT TGT GAT TCT AGA GTA 
TAC CGT GGT GGT TCT AAT TAA ACA CTA AGA TCT CAT 



Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu Ala Glu Asn 
TTA GAA CGG TAC CTC TTG GAG GCC AAG GAG GCC GAG AAT 
AAT CTT GCC ATG GAG AAC CTC CGG TTC CTC CGG CTC TTA 

25 

He Thr Thr Gly Cys Ala Glu His Cys Ser Leu A_sn Glu 
ATC ACG ACG GGC TGT GCT GAA CAC TGC AGC TTG AAT GAG 
TAG TGC TGC CCG ACA CGA CTT GTG ACG TCG AAC TTA CTC 

40 

Asn He Thr Val Pro Asp Thr Lys Val Asn Phe Tyr Ala 

AAT ATC ACT GTC CCA GAC ACC AAA GTT AAC TTT TAC GCG 

TTA TAG TGA CAG GGT CTG TGG TTT CAA TTG AAA ATG CGC 

55 

Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val 
TGG AAA AGA ATG GAA GTT GGC CAG CAG GCT GTA GAA GTA 
ACC TTT TCT TAC CTT CAA CCG GTC GTC CGA CAT CTT CAT 

70 

Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg 
TGG CAG GGA TTA GCG CTA TTA AGT GAA GCT GTT CTC CGC 
ACC GTC CCT AAT CGC GAT AAT TCA CTT CGA CAA GAG GCG 

85 

Gly Gin Ala Leu Leu Val Asn Ser Ser Gin Pro Trp Glu 
GGT CAG GCT TTA TTA GTC AAC TCT TCC CAG CCG TGG GAG 
CCA GTC CGA AAT AAT CAG TTG AGA AGG GTC GGC ACC CTC 



Pro Leu Gin Leu His Val Asp 
CCC CTG CAG CTG CAT GTG GAT 
GGG GAT GTC GAC GTA CAC CTA 



Arg Ser Leu Thr Thr Leu 
CGC AGC CTC ACC ACT CTG 
GCG TCG GAG TGG TGA GAC 



100 

Lys Ala Val Ser Gly Leu 
AAA GCC GTC AGT GGC CTT 
TTT CGG CAG TCA CCG GAA 

115 

Leu Arg Ala Leu Gly Ala Gin 
CTT CGG GCT CTG GGA GCC CAG' 
GAA GCC CGA GAC CCT CGG GTC 
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FIGURE 3b 
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